Noise robust digit recognition using a glottal radar sensor for voicing detection
نویسندگان
چکیده
A voicing feature is used in concatenation to MFCC features to increase the performance of digit recognition at both low and high SNRs. The problem of noise robust extraction of the voicing feature is solved by using the glottal electromagnetic sensor (GEMS). The GEMS device provides reliable voicing information at all SNRs and noise environments. It is shown that although the voicing feature increases the performance for the clean speech case, the relative improvement for the noisy case is significantly higher for a digit recognition task. Our results indicate that the GEMS device can solve the fundamental problem of extracting reliable voicing information in noisy environments.
منابع مشابه
Multimodal Speaker Verification Based on Electroglottograph Signal and Glottal Activity Detection
To achieve robust speaker verification, we propose a multimodal method which includes additional nonaudio features and glottal activity detector. As a nonaudio sensor an electroglottograph (EGG) is applied. Parameters of EGG signal are used to augment conventional audio feature vector. Algorithm for EGG parameterization is based on the shape of the idealized waveform and glottal activity detect...
متن کاملRobust speech/non-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments
Every speech recognition system contains a speech/non-speech detection stage. Detected speech sequences are only passed through the speech recognition stage later on. In a very noisy environment, the noise detection stage is generally responsible for most of the recognition errors. Indeed, many detected noisy periods can be recognized as a vocabulary word. This manuscript provides solutions to ...
متن کاملNoise Robust Speech Recognition Using Prosodic Information
This paper proposes a noise robust speech recognition method for Japanese utterances using prosodic information. In Japanese, the fundamental frequency (F0) contour conveys phrase intonation and word accent information. Consequently, it also conveys information about prosodic phrase and word boundaries. This paper first proposes a noise robust F0 extraction method using the Hough transform, whi...
متن کاملRobust Fault Detection on Boiler-turbine Unit Actuators Using Dynamic Neural Networks
Due to the important role of the boiler-turbine units in industries and electricity generation, it is important to diagnose different types of faults in different parts of boiler-turbine system. Different parts of a boiler-turbine system like the sensor or actuator or plant can be affected by various types of faults. In this paper, the effects of the occurrence of faults on the actuators are in...
متن کاملVoicing parameter and energy based speech/non-speech detection for speech recognition in adverse conditions
In adverse conditions, the speech recognition performance decreases in part due to imperfect speech/non-speech detection. In this paper, a new combination of voicing parameter and energy for speech/non-speech detection is described. This combination avoids especially the noise detections in real life very noisy environments and provides better performance for continuous speech recognition. This...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004